MaltParser and LIBLINEAR Transition-based dependency parsing with linear classification for feature model optimization

نویسنده

  • Sofia Cassel
چکیده

In this thesis, MaltParser has been extended with an interface to a software for large-scale linear classification (LIBLINEAR). This combination was then used for learning and parsing with four different treebanks (Slovene, Danish, Arabic, and Turkish). The initial tests compared the accuracy of MaltParser using LIBLINEAR to that when using LibSVM (MaltParser’s default classifier) with its linear algorithm. The results are significantly different in favor of LibSVM for two of the four treebanks (Danish and Arabic). The LIBLINEAR classifier was then used for optimizing a feature model for each of the four treebanks. These results showed a significant improvement compared to the baseline feature model both when using only LIBLINEAR, and when using the optimized feature model with LibSVM. Learning and parsing times are much smaller when using LIBLINEAR, though accuracy is not quite as high as that of LibSVM’s polynomial kernel. In conclusion, using LIBLINEAR for feature model optimization is a good idea, since it takes advantage of the speed of LIBLINEAR while not sacrificing accuracy when using LibSVM’s polynomial kernel for the final feature model. This also indicates that the MaltParser/LIBLINEAR combination should be explored further, and preferably with larger treebanks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MaltOptimizer: A System for MaltParser Optimization

Freely available statistical parsers often require careful optimization to produce state-of-the-art results, which can be a non-trivial task especially for application developers who are not interested in parsing research for its own sake. We present MaltOptimizer, a freely available tool developed to facilitate parser optimization using the open-source system MaltParser, a data-driven parser-g...

متن کامل

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

We introduce MaltParser, a data-driven parser generator for dependency parsing. Given a treebank in dependency format, MaltParser can be used to induce a parser for the language of the treebank. MaltParser supports several parsing algorithms and learning algorithms, and allows user-defined feature models, consisting of arbitrary combinations of lexical features, part-of-speech features and depe...

متن کامل

MaltParser at the EVALITA 2009 Dependency Parsing Task

This paper describes our participation in the EVALITA 2009 Dependency Parsing Task with a version of MaltParser. Reusing feature models developed in the CoNLL shared task 2007, we evaluated four different parsing algorithms implemented in MaltParser and found that the best results were achieved with Covington’s non-projective parsing algorithm. In the final evaluation, our system finished third...

متن کامل

An Ensemble Model for the EVALITA 2011 Dependency Parsing Task

This paper describes our participation in the EVALITA 2011 Dependency Parsing Task with an ensemble model. In the 2009 edition we participated with MaltParser, reusing feature models developed in the CoNLL shared task 2007. In 2011, we compared the results obtained by different parsing algorithms implemented in MaltParser with an ensemble model made available by Mihai Surdeanu. The best results...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009